Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 2036380 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 217.5 MiB |
| Average record size in memory | 112.0 B |
Variable types
| NUM | 8 |
|---|---|
| BOOL | 3 |
| CAT | 3 |
PJ_IDADE has 34122 (1.7%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-27 19:01:02.076267 |
|---|---|
| Analysis finished | 2020-09-27 19:08:51.087202 |
| Duration | 7 minutes and 49.01 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
CPF
Real number (ℝ≥0)
| Distinct | 1133983 |
|---|---|
| Distinct (%) | 55.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.342408284e+10 |
|---|---|
| Minimum | 1163 |
| Maximum | 9.999999417e+10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 1163 |
|---|---|
| 5-th percentile | 896463468.8 |
| Q1 | 4696805638 |
| median | 1.49572958e+10 |
| Q3 | 6.381681412e+10 |
| 95-th percentile | 9.23940709e+10 |
| Maximum | 9.999999417e+10 |
| Range | 9.999999301e+10 |
| Interquartile range (IQR) | 5.912000848e+10 |
Descriptive statistics
| Standard deviation | 3.27704197e+10 |
|---|---|
| Coefficient of variation (CV) | 0.9804433485 |
| Kurtosis | -1.164674217 |
| Mean | 3.342408284e+10 |
| Median Absolute Deviation (MAD) | 1.388917766e+10 |
| Skewness | 0.6178446598 |
| Sum | 6.806413381e+16 |
| Variance | 1.073900407e+21 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 596634307 | 239 | < 0.1% | |
| 8.593706135e+10 | 167 | < 0.1% | |
| 3077961326 | 106 | < 0.1% | |
| 5.849690417e+10 | 95 | < 0.1% | |
| 1.816014681e+10 | 52 | < 0.1% | |
| 1.372061577e+10 | 47 | < 0.1% | |
| 7.504062073e+10 | 42 | < 0.1% | |
| 6400472380 | 41 | < 0.1% | |
| 9.959913929e+10 | 39 | < 0.1% | |
| 989437310 | 34 | < 0.1% | |
| 7.654310227e+10 | 34 | < 0.1% | |
| 9.315800975e+10 | 34 | < 0.1% | |
| 8.626575076e+10 | 33 | < 0.1% | |
| 1.064471367e+10 | 32 | < 0.1% | |
| 9.141596979e+10 | 32 | < 0.1% | |
| 5363279479 | 32 | < 0.1% | |
| 1103637797 | 32 | < 0.1% | |
| 5518028717 | 31 | < 0.1% | |
| 5.359502569e+10 | 31 | < 0.1% | |
| 3152493746 | 30 | < 0.1% | |
| 8110434444 | 30 | < 0.1% | |
| 1.083728474e+10 | 29 | < 0.1% | |
| 1221005731 | 29 | < 0.1% | |
| 7.732149972e+10 | 29 | < 0.1% | |
| 7.483854723e+10 | 29 | < 0.1% | |
| Other values (1133958) | 2035051 | 99.9% |
| Value | Count | Frequency (%) | |
| 1163 | 1 | < 0.1% | |
| 1910 | 1 | < 0.1% | |
| 5150 | 1 | < 0.1% | |
| 41203 | 7 | < 0.1% | |
| 44130 | 1 | < 0.1% | |
| 80527 | 1 | < 0.1% | |
| 83623 | 1 | < 0.1% | |
| 85677 | 2 | < 0.1% | |
| 88692 | 1 | < 0.1% | |
| 100226 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9.999999417e+10 | 2 | < 0.1% | |
| 9.999989713e+10 | 1 | < 0.1% | |
| 9.999972277e+10 | 3 | < 0.1% | |
| 9.999932312e+10 | 1 | < 0.1% | |
| 9.99992851e+10 | 2 | < 0.1% | |
| 9.999897127e+10 | 2 | < 0.1% | |
| 9.999891217e+10 | 1 | < 0.1% | |
| 9.999880712e+10 | 3 | < 0.1% | |
| 9.999863737e+10 | 1 | < 0.1% | |
| 9.999862927e+10 | 1 | < 0.1% |
CNPJ
Real number (ℝ≥0)
| Distinct | 1067801 |
|---|---|
| Distinct (%) | 52.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.154005683e+13 |
|---|---|
| Minimum | 455000107 |
| Maximum | 9.7711797e+13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 455000107 |
|---|---|
| 5-th percentile | 4.2227743e+12 |
| Q1 | 1.363548425e+13 |
| median | 2.1794344e+13 |
| Q3 | 2.90989375e+13 |
| 95-th percentile | 3.507963e+13 |
| Maximum | 9.7711797e+13 |
| Range | 9.7711342e+13 |
| Interquartile range (IQR) | 1.546345325e+13 |
Descriptive statistics
| Standard deviation | 1.114489914e+13 |
|---|---|
| Coefficient of variation (CV) | 0.5174034232 |
| Kurtosis | 7.19524337 |
| Mean | 2.154005683e+13 |
| Median Absolute Deviation (MAD) | 7.73911e+12 |
| Skewness | 1.341547192 |
| Sum | 4.386374092e+19 |
| Variance | 1.242087768e+26 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.379503e+12 | 103 | < 0.1% | |
| 1.0609938e+13 | 95 | < 0.1% | |
| 5.021025e+12 | 91 | < 0.1% | |
| 7.774211e+12 | 91 | < 0.1% | |
| 7.715251e+12 | 91 | < 0.1% | |
| 5.586590002e+11 | 80 | < 0.1% | |
| 2.276269e+13 | 78 | < 0.1% | |
| 1.0014536e+13 | 75 | < 0.1% | |
| 9.116860002e+11 | 74 | < 0.1% | |
| 5.605572e+12 | 73 | < 0.1% | |
| 1.6808908e+13 | 73 | < 0.1% | |
| 1.3265725e+13 | 72 | < 0.1% | |
| 1.5334477e+13 | 71 | < 0.1% | |
| 2.8016077e+13 | 68 | < 0.1% | |
| 1.2076338e+13 | 65 | < 0.1% | |
| 3.265392e+12 | 65 | < 0.1% | |
| 1.8233963e+13 | 63 | < 0.1% | |
| 1.0627791e+13 | 63 | < 0.1% | |
| 1.5427788e+13 | 63 | < 0.1% | |
| 3.4882134e+13 | 63 | < 0.1% | |
| 6.987324e+12 | 62 | < 0.1% | |
| 5.549487e+12 | 62 | < 0.1% | |
| 1.342356e+13 | 61 | < 0.1% | |
| 1.4092821e+13 | 60 | < 0.1% | |
| 9.24845e+12 | 59 | < 0.1% | |
| Other values (1067776) | 2034559 | 99.9% |
| Value | Count | Frequency (%) | |
| 455000107 | 1 | < 0.1% | |
| 3129000153 | 1 | < 0.1% | |
| 3251000120 | 1 | < 0.1% | |
| 3574000113 | 5 | < 0.1% | |
| 5058000128 | 2 | < 0.1% | |
| 6486000175 | 1 | < 0.1% | |
| 6817000177 | 1 | < 0.1% | |
| 8151000196 | 1 | < 0.1% | |
| 9290000134 | 1 | < 0.1% | |
| 1.04780001e+10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9.7711797e+13 | 1 | < 0.1% | |
| 9.7554556e+13 | 1 | < 0.1% | |
| 9.7554536e+13 | 2 | < 0.1% | |
| 9.7554451e+13 | 1 | < 0.1% | |
| 9.7554433e+13 | 1 | < 0.1% | |
| 9.7554425e+13 | 1 | < 0.1% | |
| 9.7554233e+13 | 2 | < 0.1% | |
| 9.7554202e+13 | 5 | < 0.1% | |
| 9.7554128e+13 | 2 | < 0.1% | |
| 9.7554083e+13 | 1 | < 0.1% |
PF_GENERO
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 MiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 1021882 | 50.2% | |
| 0 | 1014498 | 49.8% |
PF_IDADE
Real number (ℝ≥0)
| Distinct | 109 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.08843291 |
|---|---|
| Minimum | 1 |
| Maximum | 121 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 33 |
| median | 41 |
| Q3 | 50 |
| 95-th percentile | 62 |
| Maximum | 121 |
| Range | 120 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 11.54921712 |
|---|---|
| Coefficient of variation (CV) | 0.2744035907 |
| Kurtosis | -0.2870915227 |
| Mean | 42.08843291 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.3744856412 |
| Sum | 85708043 |
| Variance | 133.384416 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 38 | 70964 | 3.5% | |
| 39 | 69966 | 3.4% | |
| 37 | 66711 | 3.3% | |
| 40 | 65962 | 3.2% | |
| 35 | 65794 | 3.2% | |
| 41 | 64973 | 3.2% | |
| 34 | 63981 | 3.1% | |
| 36 | 63667 | 3.1% | |
| 33 | 62684 | 3.1% | |
| 42 | 61904 | 3.0% | |
| 32 | 60616 | 3.0% | |
| 43 | 60268 | 3.0% | |
| 44 | 57937 | 2.8% | |
| 31 | 55827 | 2.7% | |
| 45 | 54627 | 2.7% | |
| 46 | 53618 | 2.6% | |
| 47 | 50661 | 2.5% | |
| 48 | 50434 | 2.5% | |
| 30 | 49817 | 2.4% | |
| 49 | 48897 | 2.4% | |
| 50 | 47899 | 2.4% | |
| 29 | 47626 | 2.3% | |
| 51 | 44738 | 2.2% | |
| 52 | 43889 | 2.2% | |
| 28 | 43234 | 2.1% | |
| Other values (84) | 609686 | 29.9% |
| Value | Count | Frequency (%) | |
| 1 | 30 | < 0.1% | |
| 2 | 57 | < 0.1% | |
| 3 | 69 | < 0.1% | |
| 4 | 35 | < 0.1% | |
| 5 | 26 | < 0.1% | |
| 6 | 45 | < 0.1% | |
| 7 | 25 | < 0.1% | |
| 8 | 36 | < 0.1% | |
| 9 | 53 | < 0.1% | |
| 10 | 41 | < 0.1% |
| Value | Count | Frequency (%) | |
| 121 | 7 | < 0.1% | |
| 120 | 4 | < 0.1% | |
| 116 | 2 | < 0.1% | |
| 110 | 5 | < 0.1% | |
| 109 | 9 | < 0.1% | |
| 104 | 3 | < 0.1% | |
| 103 | 1 | < 0.1% | |
| 102 | 2 | < 0.1% | |
| 101 | 8 | < 0.1% | |
| 100 | 9 | < 0.1% |
PJ_PORTE
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
| Value | Count | Frequency (%) | |
| 1 | 1229773 | 60.4% | |
| 2 | 614166 | 30.2% | |
| 3 | 192441 | 9.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 1229773 | 60.4% | |
| 2 | 614166 | 30.2% | |
| 3 | 192441 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2036380 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 1229773 | 60.4% | |
| 2 | 614166 | 30.2% | |
| 3 | 192441 | 9.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2036380 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 1229773 | 60.4% | |
| 2 | 614166 | 30.2% | |
| 3 | 192441 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2036380 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 1229773 | 60.4% | |
| 2 | 614166 | 30.2% | |
| 3 | 192441 | 9.5% |
PJ_SETOR
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | 6772 |
| Value | Count | Frequency (%) | |
| 1 | 886117 | 43.5% | |
| 2 | 815471 | 40.0% | |
| 3 | 328020 | 16.1% | |
| 4 | 6772 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 886117 | 43.5% | |
| 2 | 815471 | 40.0% | |
| 3 | 328020 | 16.1% | |
| 4 | 6772 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2036380 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 886117 | 43.5% | |
| 2 | 815471 | 40.0% | |
| 3 | 328020 | 16.1% | |
| 4 | 6772 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2036380 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 886117 | 43.5% | |
| 2 | 815471 | 40.0% | |
| 3 | 328020 | 16.1% | |
| 4 | 6772 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2036380 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 886117 | 43.5% | |
| 2 | 815471 | 40.0% | |
| 3 | 328020 | 16.1% | |
| 4 | 6772 | 0.3% |
| Distinct | 71 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.963640381 |
|---|---|
| Minimum | 0 |
| Maximum | 89 |
| Zeros | 34122 |
| Zeros (%) | 1.7% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 10 |
| 95-th percentile | 25 |
| Maximum | 89 |
| Range | 89 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 7.646088427 |
|---|---|
| Coefficient of variation (CV) | 0.9601247746 |
| Kurtosis | 5.459204121 |
| Mean | 7.963640381 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.108002061 |
| Sum | 16216998 |
| Variance | 58.46266823 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 239505 | 11.8% | |
| 3 | 186602 | 9.2% | |
| 1 | 172691 | 8.5% | |
| 5 | 170717 | 8.4% | |
| 4 | 168511 | 8.3% | |
| 6 | 143478 | 7.0% | |
| 7 | 138810 | 6.8% | |
| 8 | 125367 | 6.2% | |
| 10 | 122088 | 6.0% | |
| 9 | 120389 | 5.9% | |
| 11 | 42958 | 2.1% | |
| 12 | 34152 | 1.7% | |
| 0 | 34122 | 1.7% | |
| 13 | 29368 | 1.4% | |
| 14 | 23960 | 1.2% | |
| 15 | 23286 | 1.1% | |
| 16 | 21453 | 1.1% | |
| 18 | 19053 | 0.9% | |
| 17 | 19021 | 0.9% | |
| 19 | 18474 | 0.9% | |
| 20 | 17181 | 0.8% | |
| 21 | 17134 | 0.8% | |
| 23 | 15010 | 0.7% | |
| 22 | 14591 | 0.7% | |
| 24 | 13222 | 0.6% | |
| Other values (46) | 105237 | 5.2% |
| Value | Count | Frequency (%) | |
| 0 | 34122 | 1.7% | |
| 1 | 172691 | 8.5% | |
| 2 | 239505 | 11.8% | |
| 3 | 186602 | 9.2% | |
| 4 | 168511 | 8.3% | |
| 5 | 170717 | 8.4% | |
| 6 | 143478 | 7.0% | |
| 7 | 138810 | 6.8% | |
| 8 | 125367 | 6.2% | |
| 9 | 120389 | 5.9% |
| Value | Count | Frequency (%) | |
| 89 | 3 | < 0.1% | |
| 79 | 1 | < 0.1% | |
| 72 | 2 | < 0.1% | |
| 71 | 2 | < 0.1% | |
| 70 | 2 | < 0.1% | |
| 69 | 2 | < 0.1% | |
| 68 | 3 | < 0.1% | |
| 64 | 2 | < 0.1% | |
| 62 | 3 | < 0.1% | |
| 61 | 13 | < 0.1% |
PJ_NUM_FUNCIONARIOS
Real number (ℝ≥0)
| Distinct | 101 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.689320264 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 543 |
| Zeros (%) | < 0.1% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 10 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 5.630983411 |
|---|---|
| Coefficient of variation (CV) | 2.093831473 |
| Kurtosis | 86.76980429 |
| Mean | 2.689320264 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.688172723 |
| Sum | 5476478 |
| Variance | 31.70797417 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1466755 | 72.0% | |
| 2 | 179637 | 8.8% | |
| 3 | 78142 | 3.8% | |
| 5 | 60312 | 3.0% | |
| 4 | 53773 | 2.6% | |
| 10 | 30068 | 1.5% | |
| 6 | 29008 | 1.4% | |
| 8 | 19910 | 1.0% | |
| 7 | 17728 | 0.9% | |
| 20 | 12725 | 0.6% | |
| 9 | 11196 | 0.5% | |
| 12 | 10233 | 0.5% | |
| 15 | 9095 | 0.4% | |
| 11 | 5836 | 0.3% | |
| 13 | 4842 | 0.2% | |
| 14 | 4447 | 0.2% | |
| 30 | 3754 | 0.2% | |
| 16 | 3501 | 0.2% | |
| 19 | 3376 | 0.2% | |
| 18 | 3257 | 0.2% | |
| 17 | 2418 | 0.1% | |
| 25 | 2340 | 0.1% | |
| 22 | 1993 | 0.1% | |
| 40 | 1491 | 0.1% | |
| 21 | 1472 | 0.1% | |
| Other values (76) | 19071 | 0.9% |
| Value | Count | Frequency (%) | |
| 0 | 543 | < 0.1% | |
| 1 | 1466755 | 72.0% | |
| 2 | 179637 | 8.8% | |
| 3 | 78142 | 3.8% | |
| 4 | 53773 | 2.6% | |
| 5 | 60312 | 3.0% | |
| 6 | 29008 | 1.4% | |
| 7 | 17728 | 0.9% | |
| 8 | 19910 | 1.0% | |
| 9 | 11196 | 0.5% |
| Value | Count | Frequency (%) | |
| 100 | 639 | < 0.1% | |
| 99 | 79 | < 0.1% | |
| 98 | 33 | < 0.1% | |
| 97 | 33 | < 0.1% | |
| 96 | 25 | < 0.1% | |
| 95 | 26 | < 0.1% | |
| 94 | 5 | < 0.1% | |
| 93 | 26 | < 0.1% | |
| 92 | 51 | < 0.1% | |
| 91 | 10 | < 0.1% |
CANAL_ATENDIMENTO
Real number (ℝ≥0)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.672515935 |
|---|---|
| Minimum | 1 |
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.181288277 |
|---|---|
| Coefficient of variation (CV) | 0.7062941837 |
| Kurtosis | 3.254986087 |
| Mean | 1.672515935 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.975144269 |
| Sum | 3405878 |
| Variance | 1.395441994 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1340678 | 65.8% | |
| 2 | 364113 | 17.9% | |
| 3 | 134362 | 6.6% | |
| 4 | 87844 | 4.3% | |
| 5 | 73786 | 3.6% | |
| 6 | 35597 | 1.7% |
| Value | Count | Frequency (%) | |
| 1 | 1340678 | 65.8% | |
| 2 | 364113 | 17.9% | |
| 3 | 134362 | 6.6% | |
| 4 | 87844 | 4.3% | |
| 5 | 73786 | 3.6% | |
| 6 | 35597 | 1.7% |
| Value | Count | Frequency (%) | |
| 6 | 35597 | 1.7% | |
| 5 | 73786 | 3.6% | |
| 4 | 87844 | 4.3% | |
| 3 | 134362 | 6.6% | |
| 2 | 364113 | 17.9% | |
| 1 | 1340678 | 65.8% |
TEMA_ATENDIMENTO
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.788784019 |
|---|---|
| Minimum | 1 |
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.205156225 |
|---|---|
| Coefficient of variation (CV) | 0.5820221513 |
| Kurtosis | -0.4210133738 |
| Mean | 3.788784019 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3937365601 |
| Sum | 7715404 |
| Variance | 4.862713978 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 604326 | 29.7% | |
| 1 | 520002 | 25.5% | |
| 5 | 396770 | 19.5% | |
| 2 | 161892 | 7.9% | |
| 7 | 131579 | 6.5% | |
| 9 | 75860 | 3.7% | |
| 8 | 61334 | 3.0% | |
| 3 | 43901 | 2.2% | |
| 6 | 40716 | 2.0% |
| Value | Count | Frequency (%) | |
| 1 | 520002 | 25.5% | |
| 2 | 161892 | 7.9% | |
| 3 | 43901 | 2.2% | |
| 4 | 604326 | 29.7% | |
| 5 | 396770 | 19.5% | |
| 6 | 40716 | 2.0% | |
| 7 | 131579 | 6.5% | |
| 8 | 61334 | 3.0% | |
| 9 | 75860 | 3.7% |
| Value | Count | Frequency (%) | |
| 9 | 75860 | 3.7% | |
| 8 | 61334 | 3.0% | |
| 7 | 131579 | 6.5% | |
| 6 | 40716 | 2.0% | |
| 5 | 396770 | 19.5% | |
| 4 | 604326 | 29.7% | |
| 3 | 43901 | 2.2% | |
| 2 | 161892 | 7.9% | |
| 1 | 520002 | 25.5% |
ABORDAGEM_ATENDIMENTO
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 MiB |
| 0 | |
|---|---|
| 1 | 76381 |
| Value | Count | Frequency (%) | |
| 0 | 1959999 | 96.2% | |
| 1 | 76381 | 3.8% |
CATEGORIA_ATENDIMENTO
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 MiB |
| 0 | |
|---|---|
| 2 |
| Value | Count | Frequency (%) | |
| 0 | 1108433 | 54.4% | |
| 2 | 927947 | 45.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1108433 | 54.4% | |
| 2 | 927947 | 45.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2036380 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1108433 | 54.4% | |
| 2 | 927947 | 45.6% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2036380 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1108433 | 54.4% | |
| 2 | 927947 | 45.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2036380 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1108433 | 54.4% | |
| 2 | 927947 | 45.6% |
INSTRUMENTO_ATENDIMENTO
Real number (ℝ≥0)
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.595919229 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 15.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8875848309 |
|---|---|
| Coefficient of variation (CV) | 0.5561589927 |
| Kurtosis | 1.221163122 |
| Mean | 1.595919229 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.370494125 |
| Sum | 3249898 |
| Variance | 0.787806832 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1283927 | 63.0% | |
| 2 | 367302 | 18.0% | |
| 3 | 329215 | 16.2% | |
| 4 | 35958 | 1.8% | |
| 5 | 19978 | 1.0% |
| Value | Count | Frequency (%) | |
| 1 | 1283927 | 63.0% | |
| 2 | 367302 | 18.0% | |
| 3 | 329215 | 16.2% | |
| 4 | 35958 | 1.8% | |
| 5 | 19978 | 1.0% |
| Value | Count | Frequency (%) | |
| 5 | 19978 | 1.0% | |
| 4 | 35958 | 1.8% | |
| 3 | 329215 | 16.2% | |
| 2 | 367302 | 18.0% | |
| 1 | 1283927 | 63.0% |
MEIO_ATENDIMENTO
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 MiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 1810516 | 88.9% | |
| 1 | 225864 | 11.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CPF | CNPJ | PF_GENERO | PF_IDADE | PJ_PORTE | PJ_SETOR | PJ_IDADE | PJ_NUM_FUNCIONARIOS | CANAL_ATENDIMENTO | TEMA_ATENDIMENTO | ABORDAGEM_ATENDIMENTO | CATEGORIA_ATENDIMENTO | INSTRUMENTO_ATENDIMENTO | MEIO_ATENDIMENTO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.376741e+09 | 1.181879e+13 | 1 | 36 | 1 | 2 | 10 | 1 | 2 | 5 | 0 | 2 | 1 | 1 |
| 1 | 9.055791e+10 | 6.295390e+11 | 0 | 45 | 2 | 1 | 25 | 2 | 2 | 7 | 0 | 2 | 1 | 1 |
| 2 | 5.763942e+09 | 3.089587e+13 | 1 | 37 | 1 | 1 | 2 | 3 | 1 | 5 | 0 | 2 | 1 | 0 |
| 3 | 2.800365e+09 | 2.679087e+13 | 1 | 41 | 1 | 3 | 3 | 1 | 1 | 1 | 0 | 0 | 1 | 0 |
| 4 | 1.235194e+10 | 2.761945e+13 | 0 | 23 | 1 | 1 | 3 | 1 | 1 | 1 | 0 | 2 | 1 | 0 |
| 5 | 6.796476e+10 | 3.094479e+13 | 0 | 48 | 1 | 2 | 2 | 1 | 1 | 1 | 0 | 0 | 1 | 0 |
| 6 | 6.930782e+09 | 3.650945e+13 | 1 | 31 | 1 | 3 | 0 | 1 | 6 | 7 | 0 | 0 | 2 | 0 |
| 7 | 9.257545e+08 | 3.426127e+12 | 1 | 39 | 3 | 1 | 21 | 1 | 3 | 5 | 0 | 0 | 2 | 0 |
| 8 | 9.141819e+10 | 1.853110e+13 | 1 | 33 | 1 | 3 | 7 | 1 | 1 | 5 | 0 | 2 | 1 | 1 |
| 9 | 7.223612e+10 | 1.155983e+13 | 1 | 39 | 1 | 2 | 10 | 1 | 1 | 4 | 0 | 0 | 1 | 0 |
Last rows
| CPF | CNPJ | PF_GENERO | PF_IDADE | PJ_PORTE | PJ_SETOR | PJ_IDADE | PJ_NUM_FUNCIONARIOS | CANAL_ATENDIMENTO | TEMA_ATENDIMENTO | ABORDAGEM_ATENDIMENTO | CATEGORIA_ATENDIMENTO | INSTRUMENTO_ATENDIMENTO | MEIO_ATENDIMENTO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2036370 | 5.494597e+09 | 2.835837e+12 | 0 | 38 | 2 | 3 | 22 | 19 | 1 | 6 | 0 | 0 | 1 | 0 |
| 2036371 | 2.549322e+09 | 5.047121e+12 | 0 | 26 | 2 | 1 | 36 | 1 | 5 | 5 | 1 | 2 | 5 | 0 |
| 2036372 | 7.949333e+10 | 4.142210e+13 | 1 | 42 | 2 | 1 | 28 | 1 | 5 | 4 | 0 | 2 | 1 | 0 |
| 2036373 | 5.258084e+09 | 1.224097e+13 | 1 | 29 | 2 | 2 | 10 | 20 | 2 | 5 | 0 | 2 | 3 | 0 |
| 2036374 | 9.603447e+10 | 2.175042e+13 | 1 | 36 | 1 | 3 | 5 | 1 | 1 | 4 | 0 | 0 | 1 | 0 |
| 2036375 | 5.382794e+09 | 1.441622e+13 | 1 | 27 | 2 | 1 | 9 | 3 | 1 | 5 | 0 | 0 | 2 | 0 |
| 2036376 | 8.491391e+10 | 1.195691e+13 | 1 | 43 | 1 | 2 | 10 | 1 | 1 | 5 | 0 | 0 | 1 | 0 |
| 2036377 | 1.124962e+10 | 9.502510e+11 | 1 | 27 | 3 | 1 | 25 | 11 | 3 | 5 | 0 | 0 | 2 | 0 |
| 2036378 | 1.181559e+10 | 1.090873e+13 | 1 | 22 | 3 | 1 | 11 | 3 | 1 | 4 | 0 | 2 | 3 | 0 |
| 2036379 | 7.281580e+10 | 2.849556e+13 | 1 | 35 | 1 | 1 | 3 | 1 | 1 | 5 | 0 | 2 | 1 | 0 |